AITopics | fudan university

Collaborating Authors

fudan university

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

On the Nonasymptotic Scaling Guarantee of Hyperparameter Estimation in Inhomogeneous, Weakly-Dependent Complex Network Dynamical Systems

Yu, Yi, Hou, Yubo, Wang, Yinchong, Zhang, Nan, Feng, Jianfeng, Lu, Wenlian

arXiv.org Machine LearningJan-23-2026

Hierarchical Bayesian models are increasingly used in large, inhomogeneous complex network dynamical systems by modeling parameters as draws from a hyperparameter-governed distribution. However, theoretical guarantees for these estimates as the system size grows have been lacking. A critical concern is that hyperparameter estimation may diverge for larger networks, undermining the model's reliability. Formulating the system's evolution in a measure transport perspective, we propose a theoretical framework for estimating hyperparameters with mean-type observations, which are prevalent in many scientific applications. Our primary contribution is a nonasymptotic bound for the deviation of estimate of hyperparameters in inhomogeneous complex network dynamical systems with respect to network population size, which is established for a general family of optimization algorithms within a fixed observation duration. While we firstly establish a consistency result for systems with independent nodes, our main result extends this guarantee to the more challenging and realistic setting of weakly-dependent nodes. We validate our theoretical findings with numerical experiments on two representative models: a Susceptible-Infected-Susceptible model and a Spiking Neuronal Network model. In both cases, the results confirm that the estimation error decreases as the network population size increases, aligning with our theoretical guarantees. This research proposes the foundational theory to ensure that hierarchical Bayesian methods are statistically consistent for large-scale inhomogeneous systems, filling a gap in this area of theoretical research and justifying their application in practice.

artificial intelligence, bayesian inference, machine learning, (15 more...)

arXiv.org Machine Learning

2601.15603

Country:

North America > United States (0.67)
Asia > China (0.47)
Europe > United Kingdom > England (0.28)

Genre:

Research Report > New Finding (0.65)
Research Report > Experimental Study (0.45)

Industry:

Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (1.00)
Health & Medicine > Therapeutic Area > Immunology (1.00)
Health & Medicine > Epidemiology (1.00)
(2 more...)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.93)

Add feedback

AnalogSeeker: An Open-source Foundation Language Model for Analog Circuit Design

Chen, Zihao, Zhuang, Ji, Shen, Jinyi, Ke, Xiaoyue, Yang, Xinyi, Zhou, Mingjie, Du, Zhuoyao, Yan, Xu, Wu, Zhouyang, Xu, Zhenyu, Huang, Jiangli, Shang, Li, Zeng, Xuan, Yang, Fan

arXiv.org Artificial IntelligenceNov-6-2025

In this paper, we propose AnalogSeeker, an effort toward an open-source foundation language model for analog circuit design, with the aim of integrating domain knowledge and giving design assistance. To overcome the scarcity of data in this field, we employ a corpus collection strategy based on the domain knowledge framework of analog circuits. High-quality, accessible textbooks across relevant subfields are systematically curated and cleaned into a textual domain corpus. To address the complexity of knowledge of analog circuits, we introduce a granular domain knowledge distillation method. Raw, unlabeled domain corpus is decomposed into typical, granular learning nodes, where a multi-agent framework distills implicit knowledge embedded in unstructured text into question-answer data pairs with detailed reasoning processes, yielding a fine-grained, learnable dataset for fine-tuning. To address the unexplored challenges in training analog circuit foundation models, we explore and share our training methods through both theoretical analysis and experimental validation. We finally establish a fine-tuning-centric training paradigm, customizing and implementing a neighborhood self-constrained supervised fine-tuning algorithm. This approach enhances training outcomes by constraining the perturbation magnitude between the model's output distributions before and after training. In practice, we train the Qwen2.5-32B-Instruct model to obtain AnalogSeeker, which achieves 85.04% accuracy on AMSBench-TQA, the analog circuit knowledge evaluation benchmark, with a 15.67% point improvement over the original model and is competitive with mainstream commercial models. Furthermore, AnalogSeeker also shows effectiveness in the downstream operational amplifier design task. AnalogSeeker is open-sourced at https://huggingface.co/analogllm/analogseeker for research use.

arxiv preprint arxiv, large language model, machine learning, (18 more...)

arXiv.org Artificial Intelligence

2508.10409

Country: Asia > China (0.95)

Genre: Research Report > New Finding (0.67)

Industry: Education (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.87)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.69)

Add feedback

Learning to Seek Evidence: A Verifiable Reasoning Agent with Causal Faithfulness Analysis

Huang, Yuhang, Lin, Zekai, Zhong, Fan, Liu, Lei

arXiv.org Artificial IntelligenceNov-4-2025

Explanations for AI models in high-stakes domains like medicine often lack verifiability, which can hinder trust. To address this, we propose an interactive agent that produces explanations through an auditable sequence of actions. The agent learns a policy to strategically seek external visual evidence to support its diagnostic reasoning. This policy is optimized using reinforcement learning, resulting in a model that is both efficient and generalizable. Our experiments show that this action-based reasoning process significantly improves calibrated accuracy, reducing the Brier score by 18\% compared to a non-interactive baseline. To validate the faithfulness of the agent's explanations, we introduce a causal intervention method. By masking the visual evidence the agent chooses to use, we observe a measurable degradation in its performance ($Δ$Brier=+0.029), confirming that the evidence is integral to its decision-making process. Our work provides a practical framework for building AI systems with verifiable and faithful reasoning capabilities.

artificial intelligence, deep learning, machine learning, (17 more...)

arXiv.org Artificial Intelligence

2511.01425

Country: Asia > China (0.15)

Genre: Research Report (0.65)

Industry:

Health & Medicine > Diagnostic Medicine > Imaging (1.00)
Health & Medicine > Nuclear Medicine (0.94)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Cognitive Science > Problem Solving (0.89)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Add feedback

LogReasoner: Empowering LLMs with Expert-like Coarse-to-Fine Reasoning for Automated Log Analysis

Ma, Lipeng, Li, Yixuan, Yang, Weidong, Zhou, Mingjie, Liu, Xinyi, Fei, Ben, Li, Shuhao, Sun, Xiaoyan, Jiang, Sihang, Xiao, Yanghua

arXiv.org Artificial IntelligenceSep-30-2025

Log analysis is crucial for monitoring system health and diagnosing failures in complex systems. Recent advances in large language models (LLMs) offer new opportunities for automated log analysis, leveraging their reasoning capabilities to perform tasks such as anomaly detection and failure prediction. However, general-purpose LLMs struggle to formulate structured reasoning workflows that align with expert cognition and deliver precise details of reasoning steps. To address these challenges, we propose LogReasoner, a coarse-to-fine reasoning enhancement framework designed to enable LLMs to reason log analysis tasks like experts. LogReasoner consists of two stages: (1) coarse-grained enhancement of expert thinking, where high-level expert thoughts are constructed from collected troubleshooting flowcharts and existing tasks to enable LLMs to formulate structured reasoning workflows and (2) fine-grained enhancement of specific steps, where we first fine-tune the LLM with task-specific stepwise solutions to enhance the LLM for instantiated reasoning, then employ the preference learning to calibrate the LLM's reasoning details from its mistakes, further strengthen the LLM's analytical granularity and correctness. We evaluate LogReasoner on four distinct log analysis tasks using open-source LLMs such as Qwen-2.5 and Llama-3. Experimental results show that LogReasoner significantly outperforms existing LLMs, achieving state-of-the-art performance and demonstrating its effectiveness in enhancing the reasoning capabilities of LLMs for log analysis.

large language model, logreasoner, machine learning, (18 more...)

arXiv.org Artificial Intelligence

2509.20798

Country: Asia > China (0.17)

Genre: Research Report > New Finding (1.00)

Industry: Information Technology > Security & Privacy (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Towards Video Text Visual Question Answering: Benchmark and Baseline

Neural Information Processing SystemsAug-19-2025, 15:02:17 GMT

As mentioned in our paper, M4-ViteVQA has 9 categories.

category, natural language, question answering, (15 more...)

Neural Information Processing Systems

Country: Asia > China > Shanghai > Shanghai (0.05)

Technology: Information Technology > Artificial Intelligence > Natural Language > Question Answering (0.52)

Add feedback

EnTao-GPM: DNA Foundation Model for Predicting the Germline Pathogenic Mutations

Lin, Zekai, Sun, Haoran, Guo, Yucheng, Yang, Yujie, Wang, Yanwen, Hu, Bozhen, Ye, Chonghang, Yang, Qirong, Zhong, Fan, Zhang, Xiaoming, Liu, Lei

arXiv.org Artificial IntelligenceJul-30-2025

Distinguishing pathogenic mutations from benign polymorphisms remains a critical challenge in precision medicine. EnTao-GPM, developed by Fudan University and BioMap, addresses this through three innovations: (1) Cross-species targeted pre-training on disease-relevant mammalian genomes (human, pig, mouse), leveraging evolutionary conservation to enhance interpretation of pathogenic motifs, particularly in non-coding regions; (2) Germline mutation specialization via fine-tuning on ClinVar and HGMD, improving accuracy for both SNVs and non-SNVs; (3) Interpretable clinical framework integrating DNA sequence embeddings with LLM-based statistical explanations to provide actionable insights. Validated against ClinVar, EnTao-GPM demonstrates superior accuracy in mutation classification. It revolutionizes genetic testing by enabling faster, more accurate, and accessible interpretation for clinical diagnostics (e.g., variant assessment, risk identification, personalized treatment) and research, advancing personalized medicine.

bioinformatics, large language model, machine learning, (22 more...)

arXiv.org Artificial Intelligence

2507.21706

Country: Asia > China (0.30)

Genre: Research Report > New Finding (0.46)

Industry:

Health & Medicine > Pharmaceuticals & Biotechnology (1.00)
Health & Medicine > Therapeutic Area > Genetic Disease (0.47)

Technology:

Information Technology > Biomedical Informatics (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Add feedback

Doctors reverse deafness, plus surprise Ozempic perks and rules for traveling with meds

FOX NewsJun-5-2024, 22:27:51 GMT

Five children who were born completely deaf have had their hearing loss reversed after an experimental treatment. The children had a hereditary form of deafness called DFNB9, which is caused by mutations in the OTOF gene. 'LIKE A MIRACLE' – Children with total deafness regained their hearing after receiving gene therapy. Doctors from Mass Eye and Ear in Boston and the Eye & ENT Hospital of Fudan University in Shanghai spoke with Fox News Digital about the groundbreaking trial. 'DANGEROUS IDEA' – Florida has become the first state to allow C-sections to be performed outside of hospitals.

artificial intelligence, deafness, surprise ozempic perk and rule, (9 more...)

FOX News

Country: Asia > China > Shanghai > Shanghai (0.27)

Industry:

Health & Medicine > Therapeutic Area > Endocrinology > Diabetes (0.71)
Health & Medicine > Therapeutic Area > Genetic Disease (0.59)

Technology: Information Technology > Artificial Intelligence (0.80)

Add feedback

Global 4D Ionospheric STEC Prediction based on DeepONet for GNSS Rays

Cai, Dijia, Shi, Zenghui, Fu, Haiyang, Liu, Huan, Qian, Hongyi, Sui, Yun, Xu, Feng, Jin, Ya-Qiu

arXiv.org Artificial IntelligenceMar-12-2024

The ionosphere is a vitally dynamic charged particle region in the Earth's upper atmosphere, playing a crucial role in applications such as radio communication and satellite navigation. The Slant Total Electron Contents (STEC) is an important parameter for characterizing wave propagation, representing the integrated electron density along the ray of radio signals passing through the ionosphere. The accurate prediction of STEC is essential for mitigating the ionospheric impact particularly on Global Navigation Satellite Systems (GNSS). In this work, we propose a high-precision STEC prediction model named DeepONet-STEC, which learns nonlinear operators to predict the 4D temporal-spatial integrated parameter for specified ground station - satellite ray path globally. As a demonstration, we validate the performance of the model based on GNSS observation data for global and US-CORS regimes under ionospheric quiet and storm conditions. The DeepONet-STEC model results show that the three-day 72 hour prediction in quiet periods could achieve high accuracy using observation data by the Precise Point Positioning (PPP) with temporal resolution 30s. Under active solar magnetic storm periods, the DeepONet-STEC also demonstrated its robustness and superiority than traditional deep learning methods. This work presents a neural operator regression architecture for predicting the 4D temporal-spatial ionospheric parameter for satellite navigation system performance, which may be further extended for various space applications and beyond.

deeponet-stec model, observation data, stec value, (14 more...)

arXiv.org Artificial Intelligence

2404.15284

Country:

North America > United States > New York > New York County > New York City (0.14)
Asia > China > Shanghai > Shanghai (0.05)
Asia > China > Jiangsu Province > Nanjing (0.04)
(11 more...)

Genre: Research Report > New Finding (0.48)

Industry:

Education (0.93)
Energy > Renewable (0.31)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.87)

Add feedback

Hate your nose? Blame your ancient cousins! Neanderthal DNA dictates the shape, study finds

Daily Mail - Science & techMay-8-2023, 09:01:17 GMT

It's something that many people are self-conscious of, and if you not a fan of your nose, we finally know who to blame. Scientists have revealed that Neanderthal DNA helps dictate the shape of your nose. A new study led by UCL researchers found that a particular gene, which leads to a taller nose, may have been the product of natural selection as ancient humans adapted to colder climates after leaving Africa. Dr Kaustubh Adhikari, who led the study, said: 'In the last 15 years, since the Neanderthal genome has been sequenced, we have been able to learn that our own ancestors apparently interbred with Neanderthals, leaving us with little bits of their DNA. 'Here, we find that some DNA inherited from Neanderthals influences the shape of our faces.

africa, homo sapien, natural selection, (14 more...)

Daily Mail - Science & tech

Country:

Africa (0.31)
Asia > East Asia (0.06)
South America > Peru (0.05)
(6 more...)

Genre: Research Report > New Finding (0.38)

Industry:

Health & Medicine > Therapeutic Area (0.32)
Health & Medicine > Pharmaceuticals & Biotechnology (0.32)

Technology: Information Technology > Artificial Intelligence (0.40)

Add feedback

Embedding Theory of Reservoir Computing and Reducing Reservoir Network Using Time Delays

Duan, Xing-Yue, Ying, Xiong, Leng, Si-Yang, Kurths, Jürgen, Lin, Wei, Ma, Huan-Fei

arXiv.org Artificial IntelligenceMay-8-2023

Reservoir computing (RC), a particular form of recurrent neural network, is under explosive development due to its exceptional efficacy and high performance in reconstruction or/and prediction of complex physical systems. However, the mechanism triggering such effective applications of RC is still unclear, awaiting deep and systematic exploration. Here, combining the delayed embedding theory with the generalized embedding theory, we rigorously prove that RC is essentially a high dimensional embedding of the original input nonlinear dynamical system. Thus, using this embedding property, we unify into a universal framework the standard RC and the time-delayed RC where we novelly introduce time delays only into the network's output layer, and we further find a trade-off relation between the time delays and the number of neurons in RC. Based on this finding, we significantly reduce the network size of RC for reconstructing and predicting some representative physical systems, and, more surprisingly, only using a single neuron reservoir with time delays is sometimes sufficient for achieving those tasks.

artificial intelligence, machine learning, reservoir neuron, (12 more...)

arXiv.org Artificial Intelligence

2303.09042

Country:

Asia > China > Shanghai > Shanghai (0.06)
Europe > Germany > Brandenburg > Potsdam (0.05)
North America > United States (0.04)

Genre: Research Report (0.40)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback